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. Abstract 

O 

We introduce a new statistical method to estimate the primordial helium 

vn ■ 

' abundance Yp from observed abundances in a sample of galaxies which have 



experienced stellar helium enrichment. Rather than using linear regression on 
metal abundance we construct a likelihood function using a Bayesian prior, 
where the key assumption is that the true helium abundance must always 
exceed the primordial value. Using a sample of measurements compiled from 



^ ' the literature we find estimates of Yp between 0.221 and 0.236, depending on 



p 

the specific subsample and prior adopted, consistent with previous estimates 
either from a linear extrapolation of the helium abundance with respect to 
metallicity, or from the helium abundance of the lowest metallicity Hii re- 
gion, I Zw 18. We also find an upper limit which is insensitive to the specific 
subsample or prior, and estimate a model-independent bound Yp < 0.243 at 
95% confidence, favoring a low cosmic baryon density and a high primordial 
deuterium abundance. The main uncertainty is not the model of stellar en- 
richment but possible common systematic biases in the estimate of Y in each 
individual HII region. 
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I. INTRODUCTION 



Big Bang Nucleosynthesis (BBN) makes a clean prediction for the primordial helium 
abundance Yp, depending on only one parameter (the baryon-to-photon ratio rj) and on that 
only weakly. A precise measurement of Yp is therefore necessary to test BBN even in the 
present situation where measurements of other predicted quantities constraining rj, such as 
the primordial lithium abundance, deuterium abundance and the cosmic baryon density, are 
not yet so precise (e.g. Walker et al. 1991, Smith et al. 1993, Sarkar 1996, Fields et al. 1996, 
Hogan 1997ab). 

Even though the bulk of the helium of the Universe originates in the Big Bang, the 
additional helium enrichment by stars cannot be ignored in estimating the primordial abun- 
dance from observations of present-day helium. Unlike the case of deuterium, we do not 
have the option of measuring directly the nearly primordial abundance of ^He at high red- 
shift. Lyman-a absorption by He"*" at high redshift now gives a rough direct estimate of 
primordial abundance (Hogan, Anderson & Rugers 1997), but require uncertain ionization 
corrections to estimate Y. The high precision needed for strong cosmological tests can only 
be attained with high signal-to-noise measurements of nebular emission lines in well char- 
acterized nearby Hii regions (Pagel et al. 1992, Izotov, Thuan, & Lipovetsky, 1994, 1997, 
Skillman et al. 1994, 1997), necessarily requiring measurements in gas with a certain amount 
of stellar helium superimposed on the primordial helium. 

Under these circumstances, what is the best way to estimate the primordial helium 
abundance Ypl It certainly helps to find H ii regions with as httle stellar helium as possible, 
as indicated both by their helium abundances and by the abundances of other heavier 
elements. The best studied example is I Zw 18 (Pagel et al. 1992, Skillman & Kennicutt 
1993, Izotov, Thuan, & Lipovetsky, 1997), with an average estimated helium abundance 
from five independent measurements Y = 0.230 ±0.004 (Olive, Skillman, & Steigman 1997), 
and a metallicity l/50th of solar. In the analysis below, we will use samples drawn from 
the complete sample of 62 individual Hii regions (including two in I Zw 18) compiled by 
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Olive, Skillman, & Steigman (1997) from the data in Pagel et al. (1992), Izotov, Thuan, 
& Lipovetsky, (1994, 1996), and Skillman et al (1994, 1997). This sample represents all 
published Y measurements of Hii regions with 0/H < 1.5 x 10~^ and N/H < 1.0 x 10~^ 
(for comparison (O/H)© = 8.5 x 10"^ and (N/H)© = 1.1 x 10"^). 

One statistical technique to extract a primordial helium abundance from such a sample 
was introduced by Peimbert & Torres-Peimbert (1974) and is still widely used. For each 
galaxy in the sample, a metallicity Z (either 0/H or N/H) and hehum abundance Y are 
measured; one then fits a linear relation between Y and Z and extrapolates to zero metallicity 
to find the primordial value. This technique offers insights into the nuclear evolution of these 
systems, and for relatively large enrichments the extrapolation is empirically well founded. 
In Ohve, Steigman & Walker (1991) and in Olive & Steigman (1995), it was shown that these 
two quantities as measured in extragalactic Hii regions are in fact strongly correlated. The 
stabihty of the resulting fits was tested by a statistical bootstrap in Olive & Scully (1996) 
and in Ohve, Skillman, & Steigman (1997) showing that the results were not particularly 
sensitive to any individual data point. 

However, one of the limitations of this method is the need to assume a linear relation 
between Y and Z, which is not well motivated. For example, helium and oxygen are not 
produced in the same stars; while helium is produced primarily in intermediate mass stars, 
oxygen is produced only in stars with masses ^ 8 Mq. Physically realistic enrichment models 
often include quadratic terms, as well as stochastic variations in enrichment history, including 
different slopes dY/dZ. It is not clear that complicating the fits to include such effects results 
in a value for primordial Yp with any greater statistical significance or reliability. Moreover 
most of the information on the primordial abundance is contained in the lowest metallicity 
points, where the correlation is not very reliably established; this information is not being 
efficiently used in regression fits dominated by highly enriched regions. 

Another approach has been to simply take the lowest, best measured points and use 
them as estimates of (or at least limits on) the primordial abundance. Indeed, it has been 
argued that it may be sufficient to determine the primordial ^He abundance from even a 

3 



single well studied low metallicity H ii region such as I Zw 18 (Kunth et al. 1994), although to 
avoid bias it should be chosen on the basis of low metallicity and not low helium abundance. 
Although these approaches yield results consistent with the linear fit to the data, caution is 
needed to avoid subtle biases if one starts off by selecting a sample by intentionally choosing 
the lowest points. Moreover in a sample of more than just one point we need a statistical 
method to combine the data, which recognizes the real spread in stellar helium enrichment 
and still recovers an unbiased estimate of the primordial value in spite of the obvious "bias" 
that the nonprimordial contributions are always additive. 

We explore here a simple but systematic Bayesian approach to these issues, which we 
believe is the simplest recipe to extract an unbiased primordial abundance estimate from 
the data, free from detailed assumptions about metal enrichment. We aim to assume as 
little as possible — only that there is some universal primordial helium abundance, and that 
all subsequent evolution has increased the abundance. We encapsulate these assumptions 
mathematically with a Bayesian "prior" and derive statistical constraints on the primordial 
abundance which explicitly recognize the bias introduced by stellar enrichment. Inspiration 
for our approach and more background on the statistical methods is given by Press' (1996) 
application of Bayesian arguments to estimates of the Hubble constant. 



II. METHOD AND APPLICATION 

Suppose we have a sample S consisting of a series of abundance measurements 1^ in a 
set of galaxies. In each case i there is some true abundance YiT which we do not measure, 
because of the measurement error. We seek to evaluate the relative probability (or likelihood) 
of obtaining the data in sample given a primordial abundance Yp. 

C{Y,)^P{S\Y,)^'[{P{Y,\Y,) (1) 

i 

where P(li |1^) denotes the probability of obtaining the measurement Y^ given the primordial 
value Y„. For this in turn we write 



4 



P{Yi\Yp) = I dY,TPinYiT)P{YiT\Yp). (2) 

The first term in the integrand is just the distribution of measurement errors in each case. 
For the second term we need to exphcitly construct a Bayesian prior P{YiT\Yp). This "enrich- 
ment probability function" encodes our assumptions about what the distribution of hehum 
abundances in galaxies in our sample ought to be, given a primordial abundance. 

What do we know about P{YiT\Yp)7 The most important property, supported by all 
models of nuclear evolution, is that there is no net destruction of helium: stars can only 
increase the primordial abundance. Thus nowhere can the true abundance be less than 
Yp-. P(YiT\Yp) = for YiT < Yp. At a more detailed level, the shape of this prior depends 
on what we think about the chemical histories of the galaxies in the sample under study; 
there will be some range of Yp of width w for which PlYi^lYp) ^ 0. Fortunately as we shall 
demonstrate, within reasonable limits the detailed shape for Y^t > Yp does not much affect 
the statistical limits on Yp-. the most important thing is its asymmetry about Y^t = Yp. 

Within this framework we can now explore estimates of P{S\Yp), and consequent statis- 
tical constraints on Yp. To start with, (1) assume that the observational errors are normally 
distributed with variances af, and (2) assume the simplest form for PlYi^lYp), a uniform 
distribution or top-hat function of width w, P{YiT\Yp) = c{Q{YiT — Yp) — Q{YiT — {Yp + w))). 
The likelihood function is then given by 

N ri 1 

^(Yp) = n / rfr,^^^e-(^-^-)^/^'^?p(y,^|y,), (3) 

where the product is over the N sample data points. The integral in (||) can be performed 
analytically, resulting in 

r ,y- ) ^ n - n | {^f (^^^j - (^) ) (4) 

The normalization constants Cj are chosen so that each of the individual functions P{Yi\Yp), 
when integrated over Yp, yield unity. To a very good approximation, c = 1/w. For sufficiently 
large w, the first error function is approximately unity. 
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We choose to leave w as a free parameter and maximize the hkehhood in both Yp and 
w — that is, we estimate w from the sample itself. For each choice of (1^, w) we compute the 
likelihood C{Yp, w); the maximum of the likelihood gives the best values of these parameters. 
The relative likelihood of other choices then allow us to derive constraints on Yp and w. This 
can be illustrated in contour plots showing the equal likelihood contours representing 1, 2 
and 3 standard deviations. These are determined by comparing the log of the likelihood to 
its peak value 

ln£ = ln£peak-sV2 (5) 

for s standard deviations. The contours in the figures represent 1, 2, and 3 standard devia- 
tions from the peak value as determined by (^. If a large number of points contribute, the 
s = 2 contour translates in the usual way to approximately 95% confidence. 

As an illustrative exercise it is useful first to consider a sample of just one galaxy, the 
lowest metallicity galaxy I Zw 18 where Y has been measured independently in two distinct 
Hii regions, yielding an average helium abundance [| {Yi,Y2) = .230 ± .004. Because the 
measured values of Yp for the two Hii regions in I Zw 18 are consistent with each other, the 
likelihood function (^) is peaked at w = and prefers a value Yp = 0.230 ± 0.004, equal to 
the average of the two points (see figure la). 

The full power of the method becomes clearer when we employ a larger statistical sample. 
In figures Ib-le, we show the effect of progressively adding more ^He data to our sample. 
The data is ordered by metallicity so that for example, the 11 point set includes the 11 
extragalactic Hii regions with the lowest values of 0/H. Except for sample selection, the 
oxygen abundance is not explicitly utilized. We show the likelihood distributions up to 
what was described Olive, Skillman, and Steigman (1997) as set C corresponding to 32 data 

^Recently, the smallness of the error bars associated with one of the two Hii regions in I Zw 18 
has been questioned due to underlying stellar absorption (Skillman, Terlevich & Terlevich 1997). 
Of course, too much weight should never be given to any one single observation. 
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points. In this case, the hnear extrapolation with respect to 0/H gave Yp = 0.230 ± 0.003. 
The cuttoffs for the various data sets considered here correspond to 0/H <35, 45, 60, and 
85 xlO~^ for the 7, 11, 18, and 32 point sets respectively.^ 

As soon as one goes beyond a 2-point data set, the dispersion in the data (normally 
associated with enrichment correlated with metallicity) produces a likelihood distribution 
which is peaked at a non-zero value for w — yielding an estimate of the stellar helium 
enrichment of the sample. Consider for example the 11-point data set. The peak of the 
distribution occurs at w = 0.017 and at that width, Yp = 0.228 ± 0.003. The individual 
helium abundances in this sample range from 0.225 to 0.251, with a weighted mean 0.237 
± 0.002. The value for Yp determined by the likelihood distribution is significantly below 
the mean value of the data. This bias is familiar from studies of the Malmquist effect: the 
best estimate of the distribution from which points are drawn is in general quite different 
from the distribution of the values for the points themselves. In the present context it is the 
asymmetry of the prior which leads to the apparent bias. 

It is interesting to note that at the 2a level, the data are consistent with a single primor- 
dial value with no subsequent enrichment {w = 0). This corresponds to a conservative 2a 
upper limit, Yp < 0.240. As the number of data points in the sample increases, the peak of 
the likelihood distribution shifts to higher values of Yp. The 2a upper limit however remains 
at = 0, and its value is relatively insensitive to the data set. These results are summarized 
in table 1. 

The most important arbitrary step in the above analysis is in the choice of prior, and 



^The "full" data set of 62 points is not used here. Indeed, when one tries to use this data set, with 
0/H as high as 145 xlO~^, the correlation between Y and 0/H cannot be neglected. Even in the 
32 point set considered here, the correlation is statistically significant. Notice in table 1 that the 
likely value of w increases from the 18 to the 32 point set; larger samples contain more information 
on enrichment but obscure the information on Y„. 
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to calibrate its effect we consider a range of physically plausible possibilities. In fact the 
top-hat function is already quite conservative towards allowing the largest possible upper 
limits on Yp — that is, since very low Y systems in the present Universe, even in the current 
samples of low metal galaxies, are quite rare, it appears that stellar helium enrichment is 
seldom very small. We could however try to bias the result by choosing a different shape 
biased a priori towards smaller (or larger) nonprimordial enrichments. Consider for example 
a sawtooth function prior, P{YiT\Yp) — (Yp + w — Yi^)/w for Yix between Yp and Yp + w 
and otherwise. This stipulates a priori that it is most likely that there has been very 
little helium enrichment, and no enrichment greater than w, thus P rises abruptly to some 
value at YiT — Yp, linearly decreasing for Yp < YiT <Yp-\-w, and zero for YiT >Yp-\-w. 
We will refer to this prior as the negative bias (named for the slope with respect to I^t)- 
Alternatively, we might choose P{YiT\Yp) = {Y^t — Yp)/w for Y^t between Yp and Yp + w 
and otherwise. This assigns a greater probability for larger helium enrichment but still 
no enrichment greater than w] thus P rises linearly from Yp to some value at Y^t — Yp-\-w. 
This is indeed the case for typical galaxy samples, where low enrichment systems are hard 
to find, so this is perhaps the more realistic prior. We refer to this prior as the positive bias. 
Taken together, these priors span the range of reasonable possibihties. 

Likelihood contours for these two additional choices for priors are shown in figures 2 
and 3 for the same subsamples discussed above. For all three priors, and for all the sam- 
ples considered, the goodness of fit of the model — the effective reduced — is less than 
unity, reflecting that the model is an adequate description of the data (and that the random 
observational errors have typically been overestimated.) The differences in the parameter 
estimates from the top-hat case are small, reflecting the fact that the most critical assump- 
tion affecting the results is the one we are most confldent in, namely the assignment of 
P(YiT\Yp) — for YiT < Yp. As is evident from the flgures, the hkelihood contours are 
shifted to the right for the negative bias prior, yielding a higher value for Yp and to the left 
for the positive bias, favoring a lower Yp. However, the 2a upper limit is found to be nearly 
independent of our choice of prior. These results are summarized in tables 2 and 3. 
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III. CONCLUSIONS 



The significant detection of nonzero w reveals that even in these samples of metal-poor 
galaxies, there is evidence for a spread in Y produced by stars. The best fit to the data 
is for values of the primordial abundance in the range 0.221 < Yp < 0.236, depending on 
the specific model and subsample. A 2a upper hmit, Yp < 0.243, holds for all cases; this 
bound is virtually independent of the stellar helium enrichment model. There are enough 
data points entering so that even if individual errors are nongaussian, this limit corresponds 
approximately to a 95% confidence level. 

The statistical evidence from this sample indicates that the internally estimated errors 
are if anything overly conservative, so the quoted limits are also generously estimated. The 
only reasonable way to reconcile the data with a higher value of Yp would be some systematic 
error in common among all the points — that is, an in-common mistake biasing the results 
of all the measurements. Of course, errors of this kind cannot be corrected by any purely 
statistical technique. 

Even with quite simple and conservative assumptions, our analysis yields a final limit 
which is sufficiently precise to overconstrain the Big Bang picture when combined with 
other data. Our limit of 1^ < 0.243 corresponds to a limit on the baryon/photon ratio 
7] < 3.5 X 10~^° and a predicted deuterium abundance D/H > 6.2 x 10~^, in conflict 
with some recent claims (e.g. Tytler et al. 1996) but in accord with others (e.g. Songaila 
et al. 1996). It is also consistent with BBN predictions of the Li abundance (Fields & 
Ohve 1996, Fields et al. 1996). For the observed microwave background temperature, the 
baryon density is predicted to be VL^h^ < 0.013, which begins to conflict with some recent 
estimates based on models of quasar Lyman-a absorption (e.g.. Ranch et al. 1997, Weinberg 
et al. 1997), but accords with other estimates of baryon density (e.g., Hogan 1997ab). An 
overall concordance of the Big Bang picture is certainly possible but will depend on which 
of these datasets "gives way" . It is clear that the rehability of Yp estimates would improve 
significantly with comprehensive studies of even one more region similar to I Zw 18. 
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Tables 



Table 1: Likeliest values and limits for the top-hat prior 



# Regions 


w Yp 


2a 


upper bound 


2 


0.230 




0.237 


7 


0.021 0.226 




0.238 


11 


0.017 0.228 




0.240 


18 


0.011 0.232 




0.241 


32 


0.013 0.234 




0.242 


Likeliest values and limits for the nega1 


# Regions 


w Yp 


2(7 


upper bound 


2 


0.230 




0.237 


7 


0.025 0.228 




0.238 


11 


0.021 0.230 




0.240 


18 


0.013 0.234 




0.241 


32 


0.015 0.236 




0.243 



Table 3: Likeliest values and limits for the positive bias 



# Regions 


w Yp 


2(7 upper bound 


2 


0.230 


0.237 


7 


0.024 0.221 


0.239 


11 


0.020 0.224 


0.239 


18 


0.014 0.229 


0.241 


32 


0.016 0.230 


0.242 
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FIGURE CAPTIONS 



Figure 1: Likelihood function showing la, 2a and 3a contours in the {Yp,w) plane, for a 
top-hat prior of width w. The -|-'s indicate the peaks of the likelihood functions. Results 
for different subsamples are shown in panels la-le, starting with the 2 points of I Zw 18 
and ending with the 32 lowest metal points. 

Figure 2: Same as Figure 1, for the negative bias prior. 

Figure 3: Same as Figure 1, for the positive bias prior. 
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